GRAPES-DD: exploiting decision diagrams for index-driven search in biological graph databases
نویسندگان
چکیده
Abstract Background Graphs are mathematical structures widely used for expressing relationships among elements when representing biomedical and biological information. On top of these representations, several analyses performed. A common task is the search one substructure within graph, called target. The problem referred to as one-to-one subgraph search, it known be NP-complete. Heuristics indexing techniques can applied facilitate search. Indexing also exploited in context searching a collection target graphs, one-to-many problem. Filter-and-verification methods that use approaches provide fast pruning graphs or parts them do not contain query. expensive verification phase then performed only on subset promising targets. strategies extract graph features at sufficient granularity level performing powerful filtering step. Features memorized data allowing an efficient access. size, querying time power key points development solutions. Results An existing approach, GRAPES, has been shown have good performance terms speed-up both cases. However, suffers size built index. For this reason, we propose GRAPES-DD, modified version GRAPES which structure replaced with Decision Diagram. Diagrams broad class encode manipulate functions efficiently. Experiments synthetic confirmed our expectation showing GRAPES-DD substantially reduced memory utilization compared without worsening time. Conclusion biochemical completely new potentially thanks their ability compactly sets by exploiting regularity, entire once, instead exploring each single element explicitly. Search based Diagram makes only, more affordable us deal huge ever growing collections structures.
منابع مشابه
Graph-driven Exploration of Relational Databases for Efficient Keyword Search
Keyword-based search is becoming the standard way to access any kind of information and it is considered today an important add-on of relational database management systems. The approaches to keyword search over relational data usually rely on a two-step strategy in which, first, tree-shaped answers are built by connecting tuples matching the given keywords and, then, potential answers are rank...
متن کاملSymmetry-Driven Decision Diagrams for Knowledge Compilation
In this paper, symmetries are exploited for achieving significant space savings in a knowledge compilation perspective. More precisely, the languages FBDD and DDG of decision diagrams are extended to the languages Sym-FBDDX,Y and Sym-DDGX,Y of symmetry-driven decision diagrams, where X is a set of ”symmetry-free” variables and Y is a set of ”top” variables. Both the time efficiency and the spac...
متن کاملSearch Algorithms for Conceptual Graph Databases
We consider a database composed of a set of conceptual graphs. Using conceptual graphs and graph homomorphism it is possible to build a basic query-answering mechanism based on semantic search. Graph homomorphism defines a partial order over conceptual graphs. Since graph homomorphism checking is an NP-Complete problem, the main requirement for database organizing and managing algorithms is to ...
متن کاملExploiting Belief State Structure in Graph Search
It is well-known that eliminating repeated states is essential for efficient search of state-space AND-OR graphs. The same technique has been found useful for searching beliefstate AND-OR graphs, which arise in nondeterministic partially observable planning problems and in partially observable games. Whereas physical states are viewed by search algorithms as atomic and admit only equality tests...
متن کاملFast Search Methods for Biological Sequence Databases
Biology researchers have a pressing need for data management technologies which will make the storage and retrieval of DNA and protein sequence data accurate and e cient. The volume of data generated by DNA sequencing is already massive and will continue to grow rapidly. Even if the current sequence databases are adequate today, they most assuredly will become inadequate in the future when far ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: BMC Bioinformatics
سال: 2021
ISSN: ['1471-2105']
DOI: https://doi.org/10.1186/s12859-021-04129-0